A Canonical Context-Preserving Representation for Open IE: Extracting Semantically Typed Relational Tuples from Complex Sentences

نویسندگان

چکیده

Modern systems that deal with inference in texts need automatized methods to extract meaning representations (MRs) from at scale. Open Information Extraction (IE) is a prominent way of extracting all potential relations given text comprehensive manner. Previous work this area has mainly focused on the extraction isolated relational tuples. Ignoring cohesive nature where important contextual information spread across clauses or sentences, state-of-the-art IE approaches are thus prone generating loose arrangement tuples lack expressiveness needed infer true complex assertions. To overcome limitation, we present method allows existing enrich their output additional meta information. By leveraging semantic hierarchy minimal propositions generated by discourse-aware Text Simplification (TS) approach presented Niklaus et al. (2019), propose mechanism semantically typed source sentences. Based novel type output, introduce lightweight representation for form normalized and context-preserving It extends shallow predicate-argument structures capturing intra-sentential rhetorical hierarchical relationships between In way, context extracted preserved, resulting more informative coherent which easier interpret. addition, comparative analysis, show benefits second dimension: canonical structure simplified sentences process analyze, facilitates tuples, an improved precision (up 32%) recall 30%) large benchmark corpus.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Detecting Opinionated Sentences by Extracting Context Information

In this paper, we briefly describe several experimental methods to solve MOAT at NTCIR-7. In the subtask of opinionated sentence detection, two methods aiming to extract the context information of each sentence are proposed. Maximum Entropy model is used to predict the polarity class. A rule-based pattern matching scheme is devised to find topic-relevant sentence. For the subtask of detecting h...

متن کامل

StreamTX: extracting tuples from streaming XML data

We study the problem of extracting flattened tuple data from streaming, hierarchical XML data. Tuple-extraction queries are essentially XML pattern queries with multiple extraction nodes. Their typical applications include mapping-based XML transformation and integrated (set-based) processing of XML and relational data. Holistic twig joins are known for the optimal matching of XML pattern queri...

متن کامل

Demonyms and Compound Relational Nouns in Nominal Open IE

Extracting open relational tuples that are mediated by nouns (instead of verbs) is important since titles and entity attributes are often expressed nominally. While appositives and possessives are easy to handle, a difficult and important class of nominal extractions requires interpreting compound noun phrases (e.g., “Google CEO Larry Page”). We substantially improve the quality of Open IE from...

متن کامل

Open Language Learning for Information Extraction

Open Information Extraction (IE) systems extract relational tuples from text, without requiring a pre-specified vocabulary, by identifying relation phrases and associated arguments in arbitrary sentences. However, stateof-the-art Open IE systems such as REVERB and WOE share two important weaknesses – (1) they extract only relations that are mediated by verbs, and (2) they ignore context, thus e...

متن کامل

Complex Lexico-syntactic Reformulation of Sentences Using Typed Dependency Representations

We present a framework for reformulating sentences by applying transfer rules on a typed dependency representation. We specify a list of operations that the framework needs to support and argue that typed dependency structures are currently the most suitable formalism for complex lexico-syntactic paraphrasing. We demonstrate our approach by reformulating sentences expressing the discourse relat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Knowledge Based Systems

سال: 2023

ISSN: ['1872-7409', '0950-7051']

DOI: https://doi.org/10.1016/j.knosys.2023.110455